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Abstract. 

Mass is the fundamental parameter that governs the evolution of stars, brown 
dwarfs, and gas-giant planets. Thus, direct mass measurements are essential to test the 
evolutionary and atmospheric models that underpin studies of these objects. We present 
results from our program to test models using precise dynamical masses for visual bina- 
ries based on Keck laser guide star adaptive optics astrometric monitoring of a sample of 
over 30 ultracool (> M6) objects since 2005. In just the last 2 years, we have more than 
tripled the number of late-M, L, and T dwarf binaries with precise dynamical masses. 
For most field binaries, based on direct measurements of their luminosities and total 
masses, we find a "temperature problem" in that evolutionary model radii give effective 
temperatures that are inconsistent with those from model atmosphere fitting of observed 
spectra by 100-300 K. We also find a "luminosity problem" for the only binary with an 
independent age determination (from its solar-type primary via age-activity-rotation 
relations). Evolutionary models underpredict the luminosities of HD 130948BC by a 
factor of «2, implying that model-based substellar mass determinations (e.g., for di- 
rectly imaged planets and cluster IMFs) may be systematically overestimating masses. 
Finally, we have employed the current sample of binary orbits to carry out a novel test 
of the earliest evolutionary stages, by using the distribution of orbital eccentricities to 
distinguish between competing models of brown dwarf formation. 



1. Introduction 

Binary systems have long been used to probe both the inner workings and the origins 
of stars. In multiple systems the laws of gravity can be used to infer the masses of stars. 
Mass is the fundamental property of any object as this largely determines its entire 
life history, and thus binaries with direct dynamical mass measurements are important 
benchmarks for testing theory. Binaries also record within their orbits a dynamical im- 
print of their formation and subsequent evolution. For a given mass, the total energy of 
the system is related solely to the semimajor axis of the orbit (E = -GM/2a), while the 
angular momentum depends on the orbital eccentricity (L = ^GMa{\ - e 2 )). Decades 
before it was known that nucleosynthesis powers stars, well-determined binary orbits 
were commonplace (e.g., > 200 orbits were published in the compilation of lAitkerJ 
Il918h and were, for exampl e, used to argue for a dynamical constraint on the age of 
the Galaxy of < 10 Gyr (Am bartsumian Il937l) as opposed to the ~ 10,000 Gyr age 
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limit supported by [Jeansl (1935). Thus, the study of binary orbits has a rich heritage, 
predating and defining much of modern astrophysics. 

It has only recently been possible to extend such studies below 0.1 Mq, to masses 
at the bottom of the main sequence and into the brown dwarf regime. The field pop- 
ulation over this mass range encompasses spectral types > M7, collectively referred 
to as "ultracool" owing to their shared low-temperature atmospheric physics. Only 
wlO years ago were the first larg e samples of ultracoo l dwarfs identi fied via wide- 
field surveys such as 2 MASS (e.gjBurgasser et al.l ll999h. DENIS (e.g jDerfosse et all 
19991) . and SDSS (e.g jHawlev et alT2 002). The underlying binary populati on was sub 



sequently uncovered by hig h-resolution imaging surveys using HST (e.g., Bou v et al 
2003 L IB urgasser et al.l20 03) and adaptive optics (AO) from the ground (e.g., Clos e et al 



20021 : iLiu & L eggett 2005). In the first half of the last decade, orbital monitoring 



of a few sele ct binaries yielded three dynamical mass measurements: LH S 1070BC 

(M8.5+M9 5:lLeinert et al.l200ll) : Gl 569Bab (M8.5+M9: ILane et al.l200~lh : and 2MASS J0746+2000AB 

(L0+L1.5: iBouy et al.ll2004l) . These first direct mass constraints on ultracool models 

probed relatively warm tempera tures (> 2200 K) and high masses (only Gl 569Bb is 

likely to be a brown dwarf; iDupuy etaL 20101) . 

Over just the last three years the number of dynamical masses sufficiently pre- 
cise for meaningful model tests (< 30%) has tripled, with components now extending 
down to masses of «30 Mj up and temperatures of wlOOO K. This rapid expansion in 
dynamical masses has been driven by high-precision orbital monitoring programs that 
utilize the relatively new capability of laser guide star (LGS) AO to resolve the shortest 
period binaries from ground-based telescopes. Our program using Keck LGS AO has 
been ongoing since 2005 and has provided five of the six new high-precision dynamical 
masses. All nine ultracool binaries with high-precision dynamical masses are listed in 
Table 1. In the cases where multiple orbits have been published for a binary, the param- 
eters from the highest quality orbit determination are given; previous orbit references 
are listed in parentheses. 



Table 1. Ultracool visual binaries with precise dynamical masses (cr M /M < 30%) 



Name 



Component 
Sp. Types 



Period 
[days] 



Eccentricity 



Total mass 
[Mq] 



Ref. 



Gl 569Bab 
LP 349-25AB 
HD 130948BC 
2M0746+20AB 
LHS 2397aAB 
2M1534-29AB 
LHS 1901AB 
LHS 1070BC 
2M2206-20AB 



M8.5+M9 
M7.5+M8 
L4+L4 
L0+L1.5 
M8+L7 
T5+T5.5 
M6.5+M6.5 
M8.5+M9.5 
M8+M8 



864.5 ±1.1 
2834+ 15 
3760 + 60 
4640 + 30 
5190 + 40 
5500+ 800 

JJU -600 

5880+ 180 
6214.7 + 0.4 
12MXCS" 



0.316 + 0.005 
0.051 +0.003 
0.176 + 0.006 
0.487 + 0.003 

0.350 ± 0.005 

25+o.n 

0.830 ± 0.005 
0.034 + 0.002 
0.25 + 0.08 



140+0.009 

120+™ 

0.109 + 0.002 

0.151+0.003 

0.146 + 0.014 

0.056 + 0.003 
1 94+0.025 

U - Ly -0.021 

0.157 + 0.009 

15+ 005 



1 (2-5) 
1(5) 

6 (5,7) 
5(8) 
9(5) 
10(5) 
1 

11(12) 
13(5) 



Refere nces. — (11 lDupuyetal.1 (12010)): (2) lLane et al.1 (1200 111 
(2004); (4) ISimonetalJ (120061): T sT TKonopackv et all <f2o7oh : 



revis ed); (7) [D upuy et al 



(10) ILiu. Dupuv & I reland 
(131 lDupuy et all (l2009al) 



(8) Bouyetal 



(11) ISeifahrt et al.l 



(3) |Zapatero Osorio et ail 
(6) Dupuy et a l (2011 , 
(9) IDupuy et alJ (12009c): 
(12) lLeinertetal.1 12001): 
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2. Testing Evolutionary and Atmospheric Models 

The growing sample of ultracool dwarf dynamical masses has enabled some of the 
strongest tests of theoretical models to date, particularly in the brown dwarf regime. 
Because brown dwarfs do not have a sustained source of internal energy generation, 
they simply grow colder and fainter over time. Thus, the sample of field brown dwarfs 
spans a wide range of masses at a given luminosity, and the only means of pinpointing 
a given brown dwarf's evolut ionary status is to measure its mass or age. As shown by 
Liu. Dupuy. & Ireland! d2008h . fundamental properties such as T e fi and log(g) are «5x 



better constrained (i.e., model tests are «5x stronger) with dynamical mass measure- 
ments as compared to age determinations, which are ordinarily of lower precision. 

One of the major components of our work has been to develop an analysis method 
that produces rigorous, quantitative tests of models based solely on directly measured 
properties (i.e., mass and luminosity). This is of fundamental importance, as other pre- 
vious approaches were sometimes based on r e ff estimates or the model-predicted color- 
magnitude dia gram, both of whic h are often prone to severe systematic b iases (e.g., see 
discussions in lGizis & Reidll2006l and Section 4.5 of iDupuy et al.ll2010l) . The discrep- 



ancies we have found between observations and models to date fall into two categories: 
(1) problems with the luminosities predicted by evolutionary models, (2) problems with 
temperature estimates derived from model atmospheres and/or from evolutionary model 
radii. 



2.1. "Luminosity Problem" 

Conventional wisdom is that the most robust predictions of substellar models are bulk 
properties such as the radius and bolometric luminosity (Lt> i) since independent models 
put forth by different groups produce nearly identical (to within a few percent) values 
at ages beyond a few Myr. Consequently, evolutionary models have become trusted to 
provide accurate mass estimates in the many situatio ns when luminosity and age are 
well constrained, such as cluste r mass functions (e .g. jLodieu et al.ll2007r) and directly 
imaged extrasolar planets (e.g.. iMarois et al]|2008h . However, until recently no direct 
measurements of luminosity and mass were available for any brown dwarfs that also 
had well determined ages. 

The first, and to date only, system that ena bles a direct test of s ubstellar luminosity 
evolution is the L4+L4 binary HD 130948BC dDupuv et al.ll2009bl) . Its model-derived 



age (given its measured total mass and component luminosities) is significantly younger 
than the age of the primary G2V star in this triple system. This is a 2o~ discrepancy 
when accounting for the errors in the binary's measured m ass and luminosities and 
the gy rochronology relation used to age-date the primary star dMamajek & Hillenbrand! 
2008). If we assume that the stellar age and its error are correct (i.e., HD 130948A is 
not simply an outlier as compared to the cluster rotation sequences), then the luminosi- 
ties predicted by evolutionary models for HD 130948B and C are a factor of «2 lower 
than what is observed. In considering this potential luminosity problem, we will now 
address the following questions: (1) is there additional evidence for models underpre- 
dicting luminosities of substellar objects; and (2) what physical mechanism could be 
responsible for this effect? 

(1) Is there additional evidence for the luminosity problem? If models underpredict 
luminosities, this would manifest as either model-derived masses (based on L\, \ and 
age) that are ~25% too high or model-derived ages (based on mass and Lboi) that are 
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Figure 1. Left: Relative astrometry of the L4+L4 binary HD 130948BC from 
HST, Gemini, and Keck AO imaging along with the best-fit orbit. Right: Given the 
measured dynamical mass, evolutionary models predict the luminosity of the com- 
ponents of HD 130948BC as a function of age (colored lines; widths indicate the 
2.0% mass error propagated through the models). The data point shows the mea- 
sured luminosity of HD 130948B at the age determined from age-activity-rotation 
relations for the primary star in this triple system (0.79+" '15 Gyr). Both HD 130948B 
and HD 130948C are a factor of «2 more luminous than predicted by models. 



too young. The former problem can be te sted in the multiple planet system around 
HR 8799. IFabrycky & Murray-Clayl (1201 Ol) have shown that there only two plausible 
scenarios for the three tightly packed, massive planets (model-derived masses of 7, 10, 
and 10 Mj up ) to be dynamically stable: (1) the inner planets must be in 2:1 resonance, 
and all three planets must have masses lower than predicted by models by at least 20%- 
30%; or (2) both the inner pair and outer pair of planets must be in 2:1 resonances, in 
which case the masses of the planets could be as much as a factor of ~2 larger. The 
careful construction of the system required in the latter case may suggest that the first 
scenario is in fact more plausible, and the "luminosity problem" may also be seen here. 

Further evidence for the luminosity problem can also come fro m objects with 
model-derived ages that appear to be too young. iDupuy et all ( 20101) have observed 
this in the M7.5+M8 binary LP 349-25AB for which Lyon models give an age of 
130 ± 20 Myr. This is essentially identical to the canonical Pleiades age of 125 Myr as 
determine d from the lithium depletion boundary independent of model luminosity pre- 
dictions (IStauffer et al.f l998). However, LP 349-25AB appears to be sig nificantly older 
than the Pleiades given the strong upper limit on its Li 1 abundance dReiners & Basri 
2009), and it also does not show spectros copic signatures of youth as is normally seen 
in young (-100 Myr) field dwarfs (e.g.. lMcGovern et al.ll2004l : lAllers et al.ll2007l) . 

(2) What physical mechanisms could explain the luminosity problem? After a 
few Myr of evolution, model cooling tracks that a dopt very different i nitial conditions 
converge to essentially the same luminosity (e.g., Bara ffeet al.ll2002l) . so accounting 
for different formation histori es such as in "cold start" models with radiative energy 
losses due to accretion (e.g., iMarley et al.l l2007h will not alter model cooling tracks 
sufficiently to explain our observed luminosity discrepancy. Rather, it is more likely 
that a process su stained over these objec t s' life times, such as convection, is responsible 
for the problem. iMullan &" MacDonaldl d2010l) have recently constructed evolutionary 
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models tailored to the components of HD 130948BC in which the onset of convec- 
tion is inhibited by very strong magnetic fields throughout the interior. This effect can 
substantially alter the radius evolution of the components of HD 130948BC a nd thus 
explain the observed discrepancy on the H-R diagram reported by lDupuy et al.1 (2009b) 
because their new models yield larger radii, which shifts model tracks to lower J e ff . 
However, at the same time these models also lower the model-predicted luminosities 
by w0.2 dex («50%), increasing the observed luminosity discrepancy. 

If the inhibition of the onset of convection has the opposite effect on t he luminosity 



evolut ion, then perhaps the efficiency of convection must be changed. Chabrier et al. 



(2007) consider the impact of strong interior magnetic fields and fast rotation on the 
convective energy transport in low-mass stars. They find that these processes result in 
lowered convective efficiency, which in turn results in bloated radii for «0.3-0.8 Mq 
stars, while lower mass fully convective stars are much less affected. The effect of re- 
duced convect i ve en ergy transport for brown dwarfs is not investigated in detail, but 
Chabrier et al.l (l2007h point out that its main effect will be to slow their contraction rate. 



This may effectively "slow down the clock" of substellar evolution, i.e., for a given 
mass and age objects would be larger and more luminous than predicted by conven- 
tional evolutionary models that do not include the effects of convective inhibition due 
to rotation or strong interior magnetic fields. 



2.2. "Temperature Problem" 

Model atmospheres offer the possibility of determining the temperature and surface 
gravity of an object from a single spectrum. This approach is widely used in the study 
of very low-mass stars, brown dwarfs, and directly imaged planets. However, the re- 
liability of such estimates (and the conclusions drawn from them) depend entirely the 
fidelity of the model atmospheres being used. This can only be assessed in special 
cases where independent constraints on the mass, age, or radius are available. We have 
utilized our measured dynamical masses of ultracool field dwarfs to provide some of 
the strongest tests of model atmospheres to date. This is made possible by the fact that 
direct mass measurements effectively pin down the evolutionary status of a field object 
to a degree of certainty commensurate with the precision of the measured mass and 
luminosity (both typically < 10%). The values of T e g and log(g) then inferred from 
evolutionary models are limited to a very narrow range, typically exceeding the preci- 
sion of the model atmosphere grid steps of 50-100 K and 0.5 dex. This evolutionary 
model-derived T e g is essentially a restatement of the model radius given the measured 
luminosity, following the definition of effective temperature, r e ff = (L\, i/4nR 2 o-) 1 ^ 4 . 
Therefore, these binaries provide precise temperature benchmarks that can be com- 
pared to results from model atmosphere fitting. 

We have found that temperatures derived from model atmospheres are typically 
inconsistent with those from evolutionary models at the level of « 100-300 K. The 
amplitude and the sign of this disagreement vary over the full range of our sample. 
For example, T dwarf model atmosphere fits give ~ 100-200 K warmer temperatures 
than evolutionary model radii ( Liu et al1l2008l). but L d warf model atmosphere fits give 



^ 200-300 K cooler temperatures (Dupuy et al.ll2009bT) . The lack of a systematic offset 
for all objects indicates that the disagreement is not simply due to a uniform error in 
predicted radii at all masses and spectral types, but without directly measured radii, 
which would provide T e ff directly, it is impossible to distinguish whether evolutionary 
or atmospheric models are responsible for the observed discrepancies. 
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Figure 2. Late-M dwarf effective temperatures determined from model atmo- 
sphere fitting compared to Lyon Dusty evolutio nary model-derived T e ff, (i.e., from 
measured total mass and individual luminosities Du puy et~aT]|2010t) . Four indepen- 
dent model atmosphere grids were tested (left to right) and two different methods of 
fitting atmospheres: fitting the full 0.95-2.42 /mi near-infrared spectrum (top pan- 
els), and fitting the Y, J, H, and K bands individually (bottom panels). In all cases, 
an offset of 250 K between the two classes of models is observed (dashed line). This 
implies that either evolutionary model estimates are too cool (i.e., radii too large by 
15%-20%) or that atmospheric model estimates are too warm by 250 K. The latter 
is more likely given that this sample spans a wide range of masses, ages, and activity 
levels, but a narrow range of T e n ■. 



If we restrict our comparisons to a narrow range of temperatures, we can essen- 
tially remove one variable from the problem and focus on how the T e tf discrepancies 
vary with mass and age. To date, the largest subset of mass benchmarks at any given 
temperature are the late-M dwarfs (Table 1). We obtained integrated-light near-infrared 
spectroscopy of four nearly equal-flux late-M binaries with component mass determi- 
nations ranging from w0.06-0.10 Mq (i. e., young brown dw arfs to old stars) and with 
a range of chromospheric activity levels (Dupuy et al.ll2010h . After fitting these spectra 
with four independent model atmosphere grids, we found that all best-fit temperatures 
were systematically «250 K warmer than the values derived from evolutionary 
models (Figure 2). Thus, the observed T e g discrepancy is the same for objects with 
very similar spectra but with very different masses, ages, and activity levels, indicating 
that the model atmospheres are largely responsible for the observed T e tf discrepancy. 



3. Testing Formation Models with Eccentricities 



Some properties of very low-mass binaries, such as orbital semimajor axes, mass ratios, 
and multiplicity fractions, can be studied in a sta tistical fash ion using a large sample 
without any knowledge of orbital motion (e.g.. lAUenl 120071) . and such studies have 
provided very important constraints on formation models at very low masses. However, 
only well-determined orbits can provide eccentricities (e), which are directly related 
to the angular momentum of the formation process. In fact, the only two theoretical 
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Figure 3. Eccentricity distribution of all very low-mass binaries with well- 
determined orbits, including both visual and spectroscopic binaries (top panels). The 
ob served distri bution is very similar to that predicted by the cluster formation model 
of iBatd ([2009, bottom left) but is inconsistent wi th the much higher eccentricities 
predicted by the gravitational instability model of Stamatellos & Whitworth (2009, 
bottom right), even after applying a bias correction to the observed distribution. 



simulations that report eccentricities of very low-ma ss bi n aries predict very different 
types of orbits: (1) the cluster formation model of iBatel (|2009h yields a total of 16 
binaries with typically modest eccentricitie s, and (2) the disk gravitational instability 
model of Stamate llos & Whitworth] ( 20091) produces 12 binaries with typically high 
eccentricities (Figure 3). 

The sample of very low-mass binaries with well-determined orbits is now of com- 
parable size to the binaries produced in theoretical simulations, and thus it can be used 
to try to discriminate between the very different model predictions. Figure 3 shows the 
eccentricity distribution for all very low-mass binaries with eccentricity uncertainties 
less than 0.1. This excludes the visual binary 2MASS J1534- 2952AB (Table 1) an d 
includes three spectroscopic binaries: P P1 15 (e = 0.42 + 05; lBasri & Martmlll999l) . 
2MASS J0535-0546 (e = 0.323 +0.006 ; IStassun et al.ll2006l) . and 2MASS J0320-0446 
(e = 0.065 + 0.016; Blak e et al.ll2008l) . This sample populates a broad range in ec- 
centticity from nearly circular to highly eccentric (e = 0.03-0.83) and shows a strong 
preference for modest eccentricities (e = 0.35, median of 0.32). The observed distribu- 
tion of eccentricities is well-matched to that predicted by the cluster formation model of 
112 {2009), although we note that this simulation is not in full agreement with observa- 
tions as it produces much wider binaries than are observed in the fi eld. In contrast, the 
gravitational instability model of IStamatellos & Whitworthl (12009b produces too many 
high-e binaries, with an absence of any modest eccentricity (e < 0.3) orbits, which is 
highly inconsistent with the observed distribution. 
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